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Bilinguals have been shown to outperfornn monolinguals in a variety of tasks that 
do not tap into linguistic processes. The origin of this bilingual advantage has been 
questioned in recent years. While some authors argue that the reason behind this apparent 
advantage is bilinguals' enhanced executive functioning, inhibitory skills and/or monitoring 
abilities, other authors suggest that the locus of these differences between bilinguals and 
monolinguals may lie in uncontrolled factors or incorrectly matched samples. In the current 
study we tested a group of 180 bilingual children and a group of 180 carefully matched 
monolinguals in a child-friendly version of the ANT task. Following recent evidence from 
similar studies with children, our results showed no bilingual advantage at all, given that 
the performance of the two groups in the task and the indices associated with the 
individual attention networks were highly similar and statistically indistinguishable. 

Keywords: bilingual advantage, inhibitory skills, executive control, attention, ANT task 



INTRODUCTION 

The so-called "bilingual advantage" (KroU and Bialystok, 2013), 
broadly understood as enhanced executive cognitive control for 
bilinguals as compared to monolinguals, has attracted very much 
interest in recent years. Different hypotheses have been proposed 
to account for this bilingual advantage, all of which predict that 
bilingual individuals will perform better than their monolingual 
peers in processing incongruent or salient irrelevant information. 
While there has been considerable evidence to date support- 
ing a bilingual advantage, very recently there has also been an 
increase in the number of studies showing a similar performance 
of bilinguals and monolinguals in non-linguistic executive con- 
trol tasks. The present study provides data collected from a large 
sample of carefully matched bilinguals and monolinguals suggest- 
ing that the so-called bilingual advantage is not generalizable and 
replicable when the samples are properly controlled. 

One of the most commonly studied tasks in which bilinguals 
have been claimed to outperform monolinguals is the classic 
Stroop task (Stroop, 1935). In this task, participants have to 
name the color in which target words are printed. The difference 
between the latencies to incongruent trials (i.e., the target word 
to be named is the name of a color and is printed in a differ- 
ent ink color; e.g., the word "green" printed in red color) and the 
latencies to congruent trials (i.e., target word and its color match; 
e.g., the word "green" printed in green) is the Stroop effect, and 
is an index of inhibitory control. The Stroop effect was found 
to be smaller in bilingual participants than in their monolin- 
gual peers and this difference has been claimed to be especially 



evident in older bilinguals when compared to their monolingual 
counterparts (e.g., Bialystok et al., 2008; Hernandez et al., 2010). 
However, as we will explain below, recent results have challenged 
these findings showing negligible differences between bilinguals 
and monolinguals in the Stroop task (Dufiabeitia et al., 2014). 

Evidence in favor of the so-called bilingual advantage has been 
also obtained using the Simon paradigm (Simon and Rudell, 
1967). In this task, participants have to respond with either their 
left or right hand depending on one specific feature of the stim- 
ulus (e.g., the color), while ignoring other salient but apparently 
irrelevant features of the target (e.g., its location). The Simon task 
includes congruent and incongruent conditions, as a function of 
the match between the relevant and irrelevant features. The differ- 
ence between congruent and incongruent trials (the Simon effect) 
has been typically found to be smaller in bilinguals than in mono- 
linguals (Bialystok et al., 2004). Again, as with the Stroop task, 
this bilingual advantage has been found to be much stronger in 
older than in younger adults (Bialystok et al., 2004). However, as 
in the case of the Stroop task, recent studies have also reported 
negligible differences between bilinguals and monolinguals in the 
Simon task (see Prior and MacWhinney, 2010; Humphrey and 
Valian, 2012; Kousaie and Phillips, 2012; Kirk et al, 2013; Paap 
and Greenberg, 2013; Sawi and Paap, 2013; Gathercole et al., 
2014). 

Another task extensively used in the attention domain to show 
the bilingual advantage is the Attentional Network Test (ANT; 
Fan et al, 2002). This task, which is a combination of the clas- 
sic flanker task (Eriksen and Eriksen, 1974) and the cueing task 
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(Posner, 1980), measures the three independent attentional net- 
works of Orienting, Alerting and Executive control (e.g.. Fan 
et al, 2003). In this task, participants need to respond to the pres- 
ence of an arrow on the screen, by indicating whether the arrow 
is pointing to the left or to the right. The critical arrow (e.g., —^) 
can be flanked by another 2 arrows on each side, either pointing 
in the same direction (Congruent trials; e.g., —^—^—^—^—^) or 
in the opposite direction (Incongruent trials; e.g., -^-^ — >•■«—-«—). 
Simple lines can also flank the central arrow, this way creating 

the Neutral condition (e.g., > - -)■ Previous to each flanker 

trial and after a random time period, participants can be cued 
about the position where the arrows are going to appear, since 
the arrows can appear either in the upper or in the lower part 
of the screen. The Cue factor can be manipulated so that partici- 
pants see a valid Spatial Cue (i.e., an asterisk in a congruent cueing 
position), a Double Cue (i.e., one asterisk in the upper part and 
another one in the lower part), a Neutral Cue (an asterisk in the 
middle of the screen) or No Cue at all. With the combination of 
these 4 cue conditions (Double, Spatial, Center and No cue) and 3 
flanker conditions (Congruent, Incongruent and Neutral), a mea- 
surement of the three attentional networks can be obtained. The 
index of the Alerting Network can be obtained by subtracting the 
reaction times in the Double Cue condition and the ones in the 
No Cue condition. Similarly, the Orienting index can be obtained 
by comparing the Central Cue and the Spatial Cue conditions. 
Finally, and possibly the most important for our purposes, the 
Conflict Effect, which is closely related to executive control, can 
be obtained by comparing the reaction times to Incongruent and 
Congruent trials. 

In the Revised ANT task (ANT-R, Fan et al., 2009) a fifth 
cueing condition was created: the Invalid Spatial Cue. This was 
conceived as the opposite of the Valid Spatial Cue, which precedes 
the target stimuli in its exact same position. The Invalid Spatial 
Cue precedes the target arrow in the opposite part of the screen, 
so that an asterisk in the lower part would precede targets appear- 
ing in the upper part of the screen, and an asterisk in the upper 
part would precede targets appearing in the lower part. By com- 
paring the (longer) latencies to the Invalid Cue condition to the 
(shorter) reaction times to the Valid Cue trials, the Validity index 
is obtained, considered as an index of reorienting attention. 

The ANT task has been found to show a different developmen- 
tal pattern for the different networks. Rueda et al. (2004), tested 
children from 6 to 10 years of age in an adapted version of the 
ANT task where the arrows were replaced with fishes to make 
it more child-friendly. Not surprisingly, they found that overall 
reaction times and error rates decreased gradually as a function 
of age. When the Alerting, Orienting, and Conflict networks were 
analyzed separately, the authors found that the developmental 
pattern was not parallel for these three networks. On the one 
hand, the Alerting network showed negligible changes between 
ages 6 and 10. Similarly, the Orienting network failed to show a 
clear-cut developmental change. In contrast, the Conflict effect 
showed a remarkable improvement from age 6 to age 7, remaining 
relatively stable after that. 

Similarly to the Stroop and the Simon tasks, when the ANT 
task has been used to explore differences between bilinguals 
and monolinguals, an intriguing pattern has been found. For 



instance, Costa et al. (2008) tested Catalan-Spanish bilinguals 
and compared them to their monolingual peers. When looking 
at the specific attention networks, they found that monolinguals 
showed larger Conflict effects than bilinguals. Besides, in the 
Alerting network, bilingual participants showed larger benefits 
than monolinguals due the presence of an Alerting Cue. They 
also reported that bilingual participants were overall faster than 
their monolingual peers regardless of the Flanker and Cue type, 
and they showed that the overall RT differences could not be sim- 
ply explained by bilinguals just being better than monolinguals 
at conflict resolution, given that they were also faster in con- 
gruent trials. Taken together, these results led them to abandon 
the hypothesis that the bilingual advantage was the consequence 
of bilinguals' better ability to process incongruent information, 
and to propose that it reflected bilinguals' enhanced monitoring 
abilities. 

To further test this hypothesis, Costa et al. (2009) ran a ver- 
sion of the ANT manipulating the monitoring demands using 
different groups of bilingual and monolingual participants. In 
a first experiment they created a low-monitoring context, with 
92% of the trials belonging to one condition (either Congruent 
or Incongruent) and 8% to the other condition, thus making the 
condition of the upcoming target highly predictable. In a second 
experiment, they created two high-monitoring contexts. In one 
of the contexts, each condition (i.e.. Congruent and Incongruent) 
was represented by 50% of the trials, making it difficult to pre- 
dict the condition of the individual trial. In the other context, the 
authors opted for a 75% congruent-25% incongruent distribu- 
tion of the trials. Costa et al. found that bilingual participants 
were overall faster than monolinguals in the highest monitor- 
ing context (namely, 50% of the trials per condition), but did 
not show differences in the magnitude of the Conflict effect. 
Contrarily, in the low-monitoring context, both groups behaved 
similarly, with no differences in overall RTs or in the magnitude 
of the Conflict effect. In the 75-25% context a slight advantage 
was found in overall RTs and in the Conflict effect for bilin- 
guals, but these effects were modest and exclusively confined to 
the first experimental block. Hence, the results reported by Costa 
et al. suggest that (1) the so-called bilingual advantage does not 
seem to be exclusively related to an enhancement of bilinguals' 
inhibitory skills (Green, 1998; Bialystok et al, 2004; KroU et al, 
2008; and see also Morales et al, 2013 for an explanation combin- 
ing inhibitory and monitoring skills), and that (2) the appearance 
of the bilingual advantage seems to be restricted to certain exper- 
imental conditions, often failing in its replication (e.g.. Prior 
and MacWhinney, 2010; Kousaie and Phillips, 2012; Paap and 
Greenberg, 2013). 

Clearly at odds with these findings reported by Costa et al. 
(2009), a recent study by Pelham and Abrams (2014) testing 
young adults who were early bilinguals, late bilinguals or mono- 
linguals in the ANT showed a significant bilingual advantage in 
conflict resolution. They found that monolinguals were slower 
than the two bilingual groups in incongruent trials, showing 
larger conflict effects than both late and early bilinguals (with no 
differences between the last two). 

Although the main focus of bilingualism research using the 
ANT task has been the Conflict effect, given its direct relationship 
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with executive control and its implications for the bilingual 
advantage based on inhibitory skills; it is worth noting that there 
has also been evidence of differences in the Alerting effect (Costa 
et al., 2008; but see Costa et al, 2009) and in the Orienting net- 
work (Colzato et al, 2008; but see Hernandez et al., 2010). Clearly, 
it is difficult to extract a take-home-message from the bulk of evi- 
dence gathered from ANT studies with bilingual and monolingual 
adult samples, given the high degree of variance in the observed 
results. 

Leaving aside the debate about critical experimental settings, 
tasks or contexts that lead to the appearance or vanishing of the 
bilingual advantage, it is worth noting that the strongest pieces of 
evidence supporting it come from adult research and especially 
from research done with elder adults. However, this bilingual 
advantage is more elusive in research with children and the num- 
ber of discrepant studies of this type has increased in recent years. 
Curiously, it should be mentioned that even researchers show- 
ing differences between bilingual and monolingual adults admit 
that the evidence in favor of a bilingual advantage in children is 
certainly limited (Bialystok et al., 2010, 2012; see also Hilchey 
and Klein, 2011, for a review). Furthermore, it has been sug- 
gested that some factors other than the mere linguistic profile 
of the participants may play a very important role in the emer- 
gence of the bilingual advantage in different tasks. For instance, 
Morton and Harper (2007) tested a group of bilingual and mono- 
lingual children in a Simon task and they found no differences in 
their performance as a function of the number of languages they 
knew. Instead, they found a significant correlation between their 
socio-economic status (SES) and their performance in the task, 
arguing that the SES, not bilingualism, was the crucial factor in 
producing the effect. Hence, the number of intra-experimental 
and external factors that seem to have a direct impact on the 
appearance (and the magnitude) of the bilingual advantage is 
increasing, and the true nature of bilingual outperformance in 
executive control tasks remains unclear, casting doubts on some 
of the claims that have lead the field in the last decade. In this 
line, Paap and Greenberg (2013) recently reported that the stud- 
ies which have failed to obtain a bilingual advantage should not 
be ignored. They noted that many of the studies showing a bilin- 
gual advantage could possibly be showing a Type I error, due to 
inadequately matched or very small groups, uncontrolled exter- 
nal factors or task-dependency effects. They concluded that the 
replicability and the cross-study reliability of this advantage are 
markedly low. 

Following this line of reasoning, in a recent study, Dunabeitia 
et al. (2014) compared the performance of a group of more than 
250 bilingual children to that of a group of very well matched 
monolinguals in both the classic Stroop task and the Numerical 
Stroop task (a variation of the classic task with minimal involve- 
ment of language). Following the claims raised, among others, 
by Paap and Greenberg (2013) and Morton and Harper (2007), 
Dunabeitia et al. carefully matched participants for age, read- 
ing and mathematical abilities, and verbal and non-verbal IQ, 
together with some socio-economic indicators. In a series of dif- 
ferent analyses, Dunabeitia et al. found no signs of a difference 
in the performance of these two groups. These findings lead 
the authors to conclude that the so-called bilingual advantage 



in executive control tasks seems to be inexistent in children. 
Nonetheless, as they acknowledged, further research is needed in 
order to shed light on the replicability of the bilingual advantage 
across tasks. 

These conclusions are also endorsed by a recent study by 
Gathercole et al. (2014), who tested a large number of Welsh 
children and adults in different tasks (« = 650 in a card sorting 
task, n = 557 in the Simon task and n = 354 in a grammatical- 
ity judgment task). The different groups tested included English 
monolinguals and bilinguals coming with different degrees of use 
of Welsh and English (i.e., bilinguals who only spoke Welsh at 
home, bilinguals who used both Welsh and English at home, and 
bilinguals coming from English-speaking homes). Importantly, 
Gathercole et al. found no evidence for a bilingual advantage. 
No differences were found in the switch cost or overall perfor- 
mance in the card sorting task. Similarly, negligible differences 
were found in the Simon task. The grammaticality judgment task 
also failed to reveal any systematic bilingual advantage. 

Considering the lively debate about how bilingualism may 
affect performance in the ANT task, in the current study we tested 
a group of 360 children (180 bilinguals, 180 monolinguals) of dif- 
ferent ages in a child-friendly version of the ANT (see Rueda et al, 
2004). Similarly to the careful matching of the participants tested 
in the study by Dunabeitia et al. (2014), special care was taken to 
avoid the influence of uncontrolled factors in the data observed. 
Following the inconsistent results obtained in the ANT with adult 
participants (see Costa et al, 2009; Pelham and Abrams, 2014), 
the absence of a bilingual advantage in the study with children 
presented by Duflabeitia et al. (2014), and the results reported by 
Rueda et al. regarding the different development of the attention 
networks as a function of age, here we investigated (1) whether 
there is a bilingual advantage in children in any of the attention 
networks, and (2) whether the development of these networks is 
similar or different for bilingual and monolingual children. 

METHODS 
PARTICIPANTS 

Two groups of participants were recruited from different schools 
in Spain (« = 360, females = 211). The first group was made up 
of 180 Spanish monolingual children (females = 106) from sec- 
ond, third, fourth and fifth grades of elementary school and grade 
one from secondary school. These monolinguals were recruited 
from Spanish schools in places where Spanish is the only official 
language, and none of them had fluent knowledge of any other 
language than Spanish. Also, none of them corresponded to any 
immigrant minority and they were only exposed to Spanish at 
home. The second group was formed by 180 bilingual children 
(females = 105) from the same grades who were born and lived 
in the Basque Country. The Basque Country is a Spanish region 
where two languages, Basque and Spanish, are co-official. All 
these bilingual children were attending bilingual schools where 
both languages were used as vehicular languages. According to 
the legal requirements, bilingual schools in the Basque Country 
ensure that teachers switch from one to the other language as 
they switch academic subjects, making sure of a similar distribu- 
tion of the languages across subjects and school time (50% in each 
language). This way, Basque children attending bilingual schools 
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are exposed actively to the two languages on a daily basis dur- 
ing schooling. A linguistic competence questionnaire filled in by 
171 of the 180 bilingual children's parents (namely, 95% of the 
sample) showed that bilingual participants had acquired the two 
languages very early in life, with overall age-of-acquisition scores 
of 0.58 years {SD = 0.77) for Spanish and of 2.23 years (SD = 
1.07) for Basque. The parents' subjective ratings for the children's 
performance in Basque and Spanish were collected on a 0-to-lO 
scale, where 10 represented a perfect knowledge and use of the 
language. Children's mean proficiency scores in Spanish was 8.65 
(SD = 1.17), and their score in Basque was 5.96 (SD = 1.63). 

The reason for selecting samples of children instead of adult 
samples is twofold. First, considering the idiosyncrasy of the bilin- 
gual educational system in the Basque Country (see above), a 
relatively high degree of control of children's use of the two 
languages can be applied. Simply by checking their academic syl- 
labus and the language in which each subject is being taught, 
daily exposure to both languages can be ensured. And second, 
considering that the most reliable pieces of evidence supporting 
the so-called bilingual advantage have been obtained for indi- 
viduals that are not at ceiling level in their executive functions 
(e.g., elderly), it could be tentatively suggested that any difference 
between bilinguals and monolinguals should also emerge in sam- 
ples of individuals who have not reached yet a fully developed 
attentional system (e.g., children). The different cognitive and 
executive skills develop progressively during childhood, and while 
some of them are relatively mature around age 12-13, many other 
executive processes are only fully developed or established during 
mid- adolescence or adulthood (see Anderson, 2002, for review). 

In order to explore the developmental trajectory of the atten- 
tion networks, we divided the sample of bilinguals and mono- 
linguals into three evenly distributed subgroups. Monolingual 
and bilingual 2nd and 3rd graders were classified as Group 
1, 4th and 5th graders were classified as Group 2, and 6th 
graders and students from the first high school grade were clas- 
sified as Group 3. 120 children were included in each group, 
half of them (n = 60) corresponding to a monolingual environ- 
ment and the other half corresponding to a bilingual context. 
Pairwise comparisons within each group showed no differences 
(allps > 0.11) between bilinguals and monolinguals in age, gen- 
der, overall reading and arithmetic skills (as assessed by their 
teachers on a l-to-5 Likert scale), verbal, non-verbal and com- 
posed IQ [obtained from the Spanish version of the Kaufman 
Brief Intelligence Test (1990), K-BIT], income at home (clas- 
sified according to the following categories: >3000€/month, 
category 1; 2001-3000€, category 2; 1601-2000€, category 3; 
1201-1600€, category 4; 750-1200€, category 5 and <750€ cat- 
egory 6), number of years of formal education of the parents, 
and parental work status (including three possible categories: 
neither works, only one of them works, both of them work). 
Furthermore, we made sure that none of the participants had 
any specific developmental, psychological, psychiatric, or educa- 
tional disorder, deficit, or special need by including a series of 
questions in this regard in the questionnaires completed by par- 
ents and teachers. Besides, none of the children had repeated any 
academic year and no chQd with scores below the 20th centile in 
verbal, non-verbal, and combined IQ tests was included in the 
sample. Hence, the two groups were carefully matched in many 



socio-economic and cognitive measures (see Table 1 for detailed 
comparisons). 

DESIGN 

In this version of the child Attention Network Test (ANT) two 
within-subject factors were manipulated. Cue type (Double Cue, 
Valid Cue, Invalid Cue, Neutral Cue and No Cue) and Flanker 
type (Incongruent, Congruent), leading to a total of 10 condi- 
tions. As already explained in the Introduction, Fan et al. (2009) 
suggested that the inclusion of an index of validity within the cue- 
ing conditions provides an additional measure of the ability to 
reorient attention. Hence, valid and invalid cues were included 
in the current design too. The cueing manipulations were cre- 
ated by presenting (or not) an asterisk on the screen prior to the 
presentation of the target strings. These cues could be presented 
at the same position of the upcoming target (Valid condition), 
or in the opposite position (Invalid condition). In order to cre- 
ate the Double Cue condition, two asterisks were presented at 
the same time above and below the center of the screen. The 
Neutral Cueing condition was created by presenting the aster- 
isk at the center of the screen, and the No Cue condition was 
created by not providing any visual cue. Regarding the flanker 
manipulation, the target was a left- or right-pointing yellow fish 
(1.6°), presented above or below the fixation cross. This central 
fish was flanked on both sides by two fishes pointing either in the 
same direction (Congruent trials), or in the opposite direction 
(Incongruent trials). The distance between the fishes was 0.21°. 
The target and flankers subtended 8.84° and were presented 1° 
above and below the fixation cross over a blue-green background. 
For detailed description of the stimuli and procedure, see Rueda 
etal. (2004). 

PROCEDURE 

All the stimuli were presented on a computer screen. Each trial 
began with a fixation cross (1° of visual angle) with a random 
duration between 400 and 1600 ms. Then a cue (an asterisk) 
could appear in any of its variants (see below) for 150 ms. Next, a 
centered fixation cross appeared on the screen for 450 ms, imme- 
diately followed by the target and flanker stimuli. The target string 
stayed on the screen until a response was given or for a maximum 
of 1700 ms. After each trial, feedback was provided. 

A session of the ANT consisted in a total of 288 trials. Each 
trial represented one of the 10 conditions mentioned above (Cue 
type X Flanker type). To keep the high-monitoring demanding 
context, 50% of the trials belonged to the Congruent condi- 
tion and the other 50% to the Incongruent condition. Regarding 
each cueing condition, there were 72 Double Cue, 48 Valid, 48 
Invalid, 48 Neutral Cue and 72 No Cue trials. Participants were 
seated at a distance of about 55 cm from the screen and they were 
instructed with a series of practice trials to indicate the direction 
of the central fishes of the strings, pressing the "L" key in the key- 
board for right responses or the "S" key for left responses. Both 
accuracy and reaction times were recorded in each experimental 
trial. 

DATA ANALYSIS 

Reaction times below 200 ms (only representing 0.12% of the 
data) were excluded. Reaction time data was trimmed by using 
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the classic 2.5SD criterion, resulting in the exclusion of the 2.49% 
of the data, and RTs associated with erroneous responses were not 
included in the latency analyses. Before focusing on the individual 
indices for each attention network, all the conditions were ana- 
lyzed in a general ANOVA including Cue Type (No Cue, Valid 
Cue, Invalid Cue, Double Cue and Neutral Cue) and Flanker 
Type (Congruent and Incongruent) as within-participant fac- 
tors, and Language (Bilinguals and Monolinguals) and Group 
(First, Second and Third group) as between-participants factors. 
In subsequent analyses we looked at the different attention net- 
works by measuring the following indexes: the difference between 
Congruent and Incongruent trials as a reflection of executive con- 
trol (Conflict effect), the differences between the Double Cue and 
the No Cue conditions for the alerting network (Alerting effect), 
the orienting network as measured by the difference between the 
trials with a Neutral Cue and trials with a Valid Cue (Orienting 
effect), and finally the difference between the trials with a Valid 
Cue vs. the trials with an Invalid Cue as markers of the Validity 
effect. Detailed information about the RT and error data is 
presented in Table 2. 

RESULTS 

GENERAL ANALYSES 

In the RT analysis, we found significant main effects of Flanker 
Type 354) = 1624.68, MSE = 1993.35, p < 0.01], Cue Type 
[f (4 i4ig) = 237.19, MSE = 1298.75, p < 0.01] and Group 
[F(2, 354) = 120.07, MSE = 66486.08, p < 0.01]. In contrast, the 
main effect of Language was not significant [_F(i_ 354) = 2.22, MSE 
= 66486.08, p > 0.13]. The 2-way interaction between Flanker 
Type and Group was significant [f(2, 354) = 12.5, MSE = 1993.35, 



p < 0.01], and the same was true for the interaction between 
Flanker Type and Cue Type [_F(4, 1416) = 24.12, MSE = 893.76, 
p < 0.01]. None of the other interactions was significant. 

In error rate analysis, both Language groups performed simi- 
larly (_F < 1). The main effects of Flanker Type [f (i_ 354) = 303.20, 
MSE = 35.25, p < 0.01], Cue Type 1416) = 11.52, MSE = 
17.61, p < 0.01], and Group [f(2, 354) = 43.53, MSE = 210.73, 
p < 0.01] were significant. The only significant interactions 
found were the Flanker Type * Group interaction [_F(2, 354) = 
6.85, MSE = 35.25, p < 0.01], and the Flanker Type * Cue Type 
interaction [f(4^ 1415) = 90.32,MS£= 17.44,p < 0.01]. 

Thus it is important to notice that none of the interactions 
with Language were significant, showing that the same effects 
hold for bilinguals and monolinguals. 

THE THREE AHENTIONAL NETWORKS 

Considering the reliable Flanker Type * Cue Type interactions, 
and following preceding research, we explored each of the effects 
mentioned above individually (i.e.. Conflict, Alerting, Orienting 
and Validity), and the manner in which the between-participants 
factors Group and Language could modulate them (see Table 3 
and Figure 1 for comparisons between Language groups; and see 
Table 4 and Figure 2 for a detailed comparison between Language 
Groups in each Age Group). 

Executive network: the Conflict effect 

In the RT analysis, the Conflict effect as measured by the fac- 
tor Condition (Congruent vs. Incongruent trials) was signifi- 
cant [_F(i, 354) = 1624.68, MSE= 398.67, p < 0.01], as well as 
the main effect of Group [f(2, 354) = 120.07, MSE = 13297.22, 



Table 2 | Reaction times and error rates to each condition. 



Conditions 



Double Cue Neutral Cue Valid Cue Invalid Cue No Cue Congruent Incongruent Total 

Mean SD Mean SD Mean SD Mean SD Mean SD Mean SD Mean SD Mean SD 



REACTION 

Bilinguals 690.30 110.13 705.87 112.55 672.67 107.13 724.39 103.20 714.49 108.34 670.88 104.01 732.21 109.47 701.55 105.75 
Monolinguals 676.12 101.31 692.86 111.44 659.59 106.71 711.09 108.94 703.99 105.34 659.41 103.97 718.05 106.95 688.73 104.48 

ERROR RATES ^jHIHHHI^^^^^^^^^Hi^lHIHHHHHHHHHHIHI^Hll^H^mHHil 

Bilinguals 4.92 5.40 4.92 5.41 4.56 5.30 5.91 6.30 5.69 5.61 3.45 4.58 6.95 5.90 5.20 4.91 

Monolinguals 4.58 5.64 4.99 5.72 4.20 5.84 5.60 6.43 5.02 5.68 3.18 4.78 6.57 6.30 4.88 5.28 



Table 3 | Attentional networl<s, measured as the difference in reaction times and error rates. 

Attentional networks 

Conflict index Orienting index Alerting index Validity index 

Mean SD Mean SD Mean SD Mean SD 





Bilinguals 


61.34 


29.51 


33.20 


39.85 


24.19 


32.82 


51.72 


41.95 


Monolinguals 


58.64 


28.94 


33.27 


38.50 


27.87 


30.76 


51.50 


42.78 


FRROR RATFS 


Bilinguals 


3.50 


3.91 


0.36 


4.22 


0.76 


4.00 


1.35 


4.51 


Monolinguals 


3.39 


3.71 


0.79 


3.77 


0.43 


4.09 


1.40 


4.28 
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p < 0.01] and the interaction between them [_F(2, 354) = 12.50, 
MS£ = 398.67, p < 0.01]. It took longer for participants to 
respond to the Incongruent trials as compared to the Congruent 
ones, and participant speed of response increased as a function of 
age (see below). Importantly, the main effect of Language was not 
significant 354) = 2.22, MSE = 13297.22, p > 0.13], and it 
did not interact with Condition (_F < 1) or with Group (F < 1). 
The 3-way Language* Condition* Group interaction was not sig- 
nificant [F(2, 354) = 2.22, MSE = 398.67, p > 0.11]. Hence, we 
can conclude that monolinguals and bUinguals showed highly 
similar Conflict effects. 

In order to explore the origin of the significant 
Condition* Group interaction, follow-up contrasts were run 
collapsing the data across linguistic profiles. Pairwise contrasts 
showed that the differences in the responses to the two types 
of flankers (Congruent, Incongruent) decreased with age. Thus, 
when comparing the Conflict effect in each Group, we observed 
that the first group showed the largest Conflict effect (average 
of 70 ms), and that this effect progressively diminished with age 
(Group 2 = 57 ms; Group 3 = 52 ms). Pairwise tests showed that 
the effect was significantly larger for Group 1 than for Group 
2 and Group 3 [Group 1 vs. Group 2: f{238) = 3.18, p < 0.01; 
Group 1 vs. Group 3: t(238) = 4.54, p < 0.01], while the differ- 
ence was not significant between Groups 2 and 3 [f(238) = 1-70, 
p < 0.1]. 

In error rate analysis only the main effects of Condition 
[P{i, 354) = 303.20, MSE = 7.05, p < 0.01] and Group 
[F{2, 354) = 43.53, MSE = 42.15, p < 0.01] were significant. 



Attentional networks in Language Groups 



70 




□ Bilinguals 
■ Monolinguals 



Conflict Validity Orienting Alerting 
Indexes 



FIGURE 1 I Comparison of indexes across Language Groups. 



The only significant interaction was found between Condition 
and Group 354) = 6.85, MSE = 7.05, p < 0.01]. Replicating 
the RT data, the error data showed a clear Conflict effect, with 
higher error rates in incongruent than in congruent conditions 
and a modulation of the percentages of errors as a function of age 
(i.e., overall error rates diminished as a function of age). Given 
the significant interaction, we can conclude that the magnitude 
of the Conflict effect decreased as a function of age. Importantly, 
the Language effect and the interactions between this and the 
other factors were negligible (allfs < 1 and allps > 0.5). 

Alerting network: the Alerting effect 

When considering the differences in RTs between the Double 
Cue and the No Cue conditions, only the main effects of 
Condition 354) = 239.44, MSE = 509.37, p < 0.01] and 
Group [P(2, 354) = 118.55, MSE = 13364.56, p < 0.01] were sig- 
nificant. The Language effect was not significant Jf(i 354) = 2.05, 
MSE = 13364. 56, p > 0.15]. None of the interactions were signif- 
icant (Fs < 1.20, ps > 0.27]. Hence, participants responded faster 
to Double Cue trials than to No Cue trials and they became overall 
faster as their age increased but the difference between the cueing 
conditions did not differ across ages or across language profiles. 

In the error rate analysis, the only significant effects corre- 
sponded to the factors Condition [_F(i 354) = 7.81, MS-E=8.25, 
p < 0.01] and Group [f(2, 354) = 41.25, MS£ = 44.43, p< 
0.01], showing that participants made more errors in No Cue 
trials than in Double Cue trials and that the number of errors 
decreased as a function of age. No other effects or interactions 
were significant (all _Fs < 1.1 and allps > 0.3). 

Orienting network: tite Orienting effect 

The Orienting effect (i.e.. Valid Cue vs. Neutral Cue) was sig- 
nificant [_F{i^ 354) = 260.30, MS£= 763.89, p < 0.01], as was the 
main effect of Group [P{2, 354) = 109.45, MSE = 14488.40, p < 
0.01]. Responses to trials with a Valid Cue were faster than 
responses to trials with a Neutral Cue and averages RTs decreased 
as a function of age. In contrast, the main effect of Language was 
not significant 354) = 2. 12, MS£= 14488.40,p > 0.14], and 
none of the interactions involving the factor Language was sig- 
nificant (all _Fs. < 1). A marginal interaction between Condition 
and Group was found [_F(2, 354) = 2.84, MS£= 763.89, p < 0.07], 
suggesting that the magnitude of the Orienting effect decreased 
with age. Follow-up pairwise contrasts showed similar Orienting 
effects for Groups 1 and 2 (39 and 34 ms, respectively; f < 1), 
and a significantly smaller effect for Group 3 [27 ms; Group 1 vs. 



Table 4 | Latency differences in attentional networl<s in each age group. 

Conflict Orienting Alerting Validity 

effect effect effect effect 

Age Group Bilinguals Monolinguals Bilinguals IVIonolinguals Bilinguals Monolinguals Bilinguals Monolinguals 

Group 1 73.53 (36.21) 66.63 (35.81) 38.02 (49.51) 39.59 (49.61) 21.71 (41.38) 30.06 (41.01) 51.56 (50.14) 56.76 (4772) 

Group 2 54.44 (21.51) 60.60 (26.15) 34.12 (38.12) 33.95 (35.92) 23.77 (32.63) 25.18 (20.58) 54.47 (43.28) 58.27 (43.66) 

Group 3 56.04 (25.29) 48.69 (20.12) 2747 (29.25) 26.26 (25.62) 2709 (21.82) 28.37 (2744) 49.14 (30.63) 39.49 (33.88) 

Means and SD (in parenthesis) are dispiayed. 
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Conflict effect Orienting effect 




'Bilinguals 
Monoiinguals 



Group 1 Group 2 Groups Group 1 Group 2 Groups 



Age Groups Age Groups 



Validity effect Alerting effect 




Bilinguals 
Monoiinguals 



Group 1 Group 2 Group 3 Group 1 Group 2 Group 3 

Age Groups Age Groups 



FIGURE 2 I The four indexes, representing tlie attentional networlcs, across age groups and language groups. 



Group 3: f(238) = 2.32, p < 0.03; Group 2 vs. Group 3: f(238) = 
1.71, p < 0.09]. 

In the error rate analysis, the only significant effects found were 
in Condition [_F(i_ 354) = 7.33, MSE = 8.06, p < 0.01], showing 
more errors in the Neutral Cue condition than in the Valid Cue 
condition, and Group [F(2. 354) = 34.74, MSE = 45.66, p < 0.01], 
showing a decrease in the amount of errors as a function of age. 
No other effects or interactions were significant (all _Fs < 1.1 and 
allps > 0.3). 

Reorienting: the Validity effect 

The difference between trials with a Valid Cue and trials with an 
Invalid Cue were significant in the RT analysis [main Condition 
effect: _F{i, 354) = 539.92, MSE = 888.06, p < 0.01], and the 
Group effect was also significant [-F{2, 354) = 117.92, MSE = 
13211.03, p < 0.01]. Invalid Cues produced longer response 
times than Valid Cues, and the overall response times decreased 
as a function of age. These two effects marginally interacted with 
each other [F^2, 354) = 2.78, MSE = 888.06, p < 0.07], suggest- 
ing that the magnitude of the Validity effect decreased with age. 
Follow-up f-tests showed that the magnitude of the Validity effect 
was similar for Groups 1 and 2 (54 and 56 ms, respectively; f < 1), 
and that the effect was smaller for Group 3 (44 ms) than for Group 
2 [f(238) = 2.44, p < 0.02] and, although marginally significant, 
than for Group 1 [f(238) = 1-84, p < 0.07]. Critically, the main 



effect of Language was not significant [1^(1 354) = 2.37, MSE = 
13211.03, p > 0.12], and none of the interactions involving the 
Language factor were significant either (all _Fs < 1.15 and ps > 
0.32). 

Parallel findings were also observed in the error rate analy- 
sis, showing significant Condition [F{i^ 354) = 35.60, MS-E = 9.59, 
p < 0.01] and Group effects [F(2, 354) = 37.15, MSE = 51.80, 
p < 0. 01], togetherwithamarginal interaction between these two 
factors [F(2, 354) = 3.03, MSE = 9.59, p < 0.06]. Again, no other 
effects or interactions were significant (all _Fs < 1 and allps > 0.5). 

BAYESIAN NULL HYPOTHESIS TESTING 

Given that classical hypothesis testing does not allow for accept- 
ing the null hypothesis, we tested the critical differences of 
interest following a Bayesian approach (see Rouder et al., 2009, 
among others). For each index (Conflict, Validity, Orienting 
and Alerting), we used a Bayes factor (BF) approach to com- 
pare a model that assumed no differences between bilinguals 
and monoiinguals (HO) against a model that assumed that bilin- 
guals perform differently from monoiinguals (HI). With this test, 
the null hypothesis is accepted if the resulting BF is below 0.3, 
and the alternative hypothesis is accepted if it is above 3 (see 
Kruschke, 201 1, Figure 3 in page 6). When comparing bilinguals 
and monoiinguals' Conflict effects, results favored the accep- 
tance of the nuU model {BE < 0.18). The other three attentional 
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the posterior distribution that falls inside the ROPE. Dotted line shows 
the proportion at the right edge of the highest posterior density interval 
(HDI). 



networks responded similarly, all of them being better explained 
by a null model as compared to the alternative {BF < 0.12 for 
the Orienting effect, BF < 0.21 for the Alerting effect, and BF < 
0.13 for the Validity effect). These results suggest support for the 
hypothesis of no difference. 

We further explored the reliability of the current lack of 
differences using Bayesian Parameter Estimation by testing the 
degree of confidence of the null value with the Region of 
Practical Equivalence (i.e., ROPE; see Kruschke, 2013, for details). 
Following this approach, a ROPE comprising the range of val- 
ues assumed to be statistically equal to the null value (i.e., how 
much of a difference is accepted to be considered equal to no dif- 
ferences at all) is determined by previous findings in the field. If at 
least 95% of the posterior distribution (i.e., the prior distribution 
updated by the distribution of the current data) falls within this 
ROPE, the null hypothesis should be accepted. In contrast, if 95% 
of the posterior distribution falls outside the ROPE, then the alter- 
native hypothesis should be accepted. The ROPE width would 
ideally be taken from preceding similar studies, but in this case 
there is not a consensus in the literature about the smallest mean- 
ingful difference. Therefore, we calculated the proportion of the 



posterior falling within the ROPE boundaries for a range of ROPE 
limits from 0 to 20 ms in each index (Conflict, Alerting, Orienting 
and Validity). This approach allows us to calculate the range of 
values surrounding 0 that should be accepted as equivalent to no- 
differences (i.e., the ROPE) to accept the nuU hypothesis. As seen 
in Figure 3, in order to get the 95% or more of our posterior dis- 
tributions within the ROPEs, the radii of the ROPEs need to be 
set to values ranging from 7.6 to 10 ms (10 ms for Alerting, 7.6 
for Orienting, 8.3 for Conflict and 8.5 for Validity). In essence, 
this means that, if we accept differences between 7.6 and 10 ms as 
equivalent to no differences at all, and given that then the major- 
ity of the distribution of the differences falls below these limits, we 
take this as support for the null hypothesis('). Considering that 
the differences found between bilinguals and monolinguals in the 
four indices are far below these cutoff points (4 ms for Alerting, 



It should be noted that this does not imply that any between-group differ- 
ence larger than 10 ms would significantly allow us to accept the alternative 
hypothesis. For this to be the case, at least 95% of the posterior distribution 
should lay outside the ROPE, and this would necessarily imply a much larger 
difference. 
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0 ms for Orienting, 3 ms for Conflict and 0 ms for Validity), con- 
sidering also that reliable differences in RTs of 10 ms in studies 
of children are rarely reported (note also that even in adult dif- 
ferences of 10 ms in the conflict effect between bUinguals and 
monolinguals may result in a non-significant effect (see Costa 
et al., 2009), we believe the data support the null hypothesis (no 
differences between bilinguals and monolinguals). 

GENERAL DISCUSSION 

The aim of this study was to investigate whether bilingual chil- 
dren exhibit an advantage as compared to their monolingual 
peers in the ANT task, which has been typically considered the 
paradigm best suited to explore the different attention networks. 
As described in the Introduction, different explanations have 
been given for the so-called bilingual advantage (see Green and 
Abulatebi, 2013; KroU and Bialystok, 2013); but all of them coin- 
cide in suggesting that the continuous use and control of (and 
switching between) two languages provides bUinguals with a set 
of enhanced attention skills that ultimately leads to the emergence 
of differences between monolinguals and bilinguals in differ- 
ent non-linguistic tasks closely associated with executive control. 
In light of some recent studies failing to replicate the bilingual 
advantage with different populations (e.g., Paap and Greenberg, 
2013; Dufiabeitia et al., 2014; Gathercole et al, 2014), and con- 
sidering the existing debate between researchers suggesting that 
bihnguals outperform monolinguals in the ANT task (e.g., Kapa 
and Colombo, 2013; Pelham and Abrams, 2014) and those sug- 
gesting that the bilingual advantage in this task is restricted to 
certain conditions and designs (e.g., Costa et al., 2009), we inves- 
tigated whether a large sample of bilingual children would exhibit 
better performance in this task than a group of carefully matched 
monolingual children. Our results unambiguously demonstrated 
that the so-called bilingual advantage could not be replicated in 
the ANT when a sufficiently large and well-matched group of 
bilingual and monolingual children were tested. 

Our results add to a growing body of evidence showing that 
most forms of bilingual advantage in tasks exploring attention 
skills may well be the result of uncontrolled factors (e.g., Morton 
and Harper, 2007; Paap and Greenberg, 2013; see also Paap and 
Liu, 2014, and Paap, submitted, for review) or specific condi- 
tions associated with the design and procedure (e.g., Costa et al, 
2009). Also, together with the results provided by Dunabeitia et al. 
(2014) from a large-scale study testing monolingual and bilin- 
gual children in two different versions of the Stroop task and by 
Gathercole et al. (2014), who tested a large number of Welsh- 
English bilinguals and English monolinguals in different tasks, 
these results demonstrate the clear similarity between monolin- 
gual and bilingual children in their performance in tasks with 
high executive control demands. 

We argue that if the so-called bilingual advantage were a 
consequence of bilinguals' enhanced inhibitory skills, a reduced 
Conflict effect should have been found for the bilingual group 
(i.e., smaller differences between Incongruent and Congruent tri- 
als for bilinguals than for monolinguals). This was not the case, 
and participants performed in a highly similar fashion in these 
two conditions regardless of their linguistic profile. On the other 
hand, if the previously reported bilingual advantage were the 



result of bilinguals' enhanced monitoring skills, one would have 
expected an overall difference between groups in the RTs and/or 
in the error rates (e.g., Costa et al., 2009; see also Wu and Thierry, 
2013), but again we did not find any supporting data for this claim 
(see also Dufiabeitia et al., 2014, for similar results). 

It is worth mentioning that the lack of a bilingual advantage 
in this study cannot be ascribed to a general lack of sensitiv- 
ity of our design to the specific attention network(s) that may 
underlie such a difference between bilinguals and monolinguals. 
Replicating preceding evidence from the monolingual domain, 
we have shown that bilingual and monolingual children exhib- 
ited longer latencies and higher error rates for Incongruent trials 
than for Congruent trials (namely, a significant Conflict effect). 
Similarly, a better performance of both groups was found in the 
Double Cue trials as compared to the No Cue trials (namely, a sig- 
nificant Alerting effect). Also, participants' responses to the Valid 
Cue trials were faster and more accurate than their responses to 
Central Cue (i.e., a significant Orienting effect). Finally, partici- 
pants showed longer RTs and higher error rates in trials involving 
an Invalid Cue than in trials with a Valid Cue (i.e., a significant 
Validity effect). Hence, considering that the current results fully 
replicate the indices observed in preceding studies with the ANT 
task (e.g.. Fan and Posner, 2004; Fan et al., 2005; Wang and Fan, 
2007; Ishigami and Klein, 2010; Yin et al, 2012; Mackie et al, 
2013 among many others), it is hardly possible that potential dif- 
ferences between bilinguals and monolinguals were masked due 
to a lack of statistical power of the current study (see also the 
magnitude of the _F- values at this regard). Furthermore, from 
a developmental point of view, the current study has replicated 
and extended the findings observed by Rueda et al. (2004) in a 
similar study testing a smaller group of monolingual children. 
The same developmental trend observed in that study can be 
seen here, suggesting that the Conflict effect (hence, the execu- 
tive network) is the attentional index that is most sensitive to a 
developmental change, greatly diminishing as a function of age. 
On the other hand, we see more modest changes in the Validity 
and Orienting effects (note that the interactions were marginally 
significant in spite of the sample size), and no significant changes 
in the Alerting effect as a consequence of age. 

In a nutshell, and in spite of the statistical power of the current 
study, no significant differences between bilingual and monolin- 
gual children emerged in their performance in the ANT task. 
Furthermore, when taking the Bayesian approach to test the null 
hypothesis against the alternative, the null appears as the strongest 
candidate. When the analysis was based on the ROPE approach, 
we also found support for the null hypothesis. In this analysis we 
found limits for the difference between groups that were in fact 
larger than previously reported differences in adults. 

Certainly, we want to avoid generalizing the observed lack 
of bilingual advantage to other age groups, and as already dis- 
cussed in Dunabeitia et al. (2014), our claims are exclusively 
endorsing the conclusion that the so-called bilingual advantage 
in tasks focusing on participants' attention skills is inexistent, or 
at best, extremely inconsistent and elusive. As discussed in the 
Introduction, both behavioral and neuroimaging evidence (see, 
among many others, Luk et al, 2010; Gold et al., 2013) sug- 
gest some form of bilingual advantage in similar tasks with adult 
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samples. Hence, as mentioned by KroU and Bialystok (2013), 
the existence of a bilingual advantage in adulthood cannot be 
ignored, even though the degree to which those findings can be 
generalized to all adult bilingual samples is limited (see Paap and 
Greenberg, 2013, among others). It should be considered that 
the so-called bilingual advantage may emerge as a consequence 
of lifelong bilingualism mainly in later stages of life (e.g., the 
elderly). 

Leaving aside the debate about the stability of the bilingual 
advantage in attention-related skills in adulthood, what the cur- 
rent results highlight is that the differences observed during 
young and old adulthood between monolinguals and bilinguals 
are not observed during childhood. This, together with recent 
evidence showing larger differences in older than in younger par- 
ticipants (e.g., Gold et al, 2013), suggests a highly variable nature 
of the so-called bilingual advantage, which seems to be strongly 
dependent on a number of specific factors, among which the age 
of the samples should be carefully considered in future studies. 
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